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Abstract: Multi-channel systems appear in several fields of application in science. In 
the Synthetic Aperture Radar (SAR) context, multi-channel systems may refer to different 
domains, as multi-polarization, multi-interferometric or multi-temporal data, or even a 
combination of them. Due to the inherent speckle phenomenon present in SAR images, the 
statistical description of the data is almost mandatory for its utilization. The complex images 
acquired over natural media present in general zero-mean circular Gaussian characteristics. 
In this case, second order statistics as the multi-channel covariance matrix fully describe the 
data. For practical situations however, the covariance matrix has to be estimated using a 
limited number of samples, and this sample covariance matrix follow the complex Wishart 
distribution. In this context, the eigendecomposition of the multi-channel covariance matrix 
has been shown in different areas of high relevance regarding the physical properties of 
the imaged scene. Specifically, the maximum eigenvalue of the covariance matrix has 
been frequently used in different applications as target or change detection, estimation 
of the dominant scattering mechanism in polarimetric data, moving target indication, 
etc. In this paper, the statistical behavior of the maximum eigenvalue derived from the 
eigendecomposition of the sample multi-channel covariance matrix in terms of multi-channel 
SAR images is simplified for SAR community. Validation is performed against simulated 
data and examples of estimation and detection problems using the analytical expressions are 
as well given. 

Keywords: multi-channel systems; sample eigenvalues; Wishart distribution; MIMO; 
maximum eigenvalue; SAR 
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1. Introduction 

Multi-channel systems with random nature appear in a wide range of fields in the literature. For 
several of them, the central limit theorem applies and their random behavior can be modeled by Gaussian 
statistics, being thus statistically fully described by the first and second order moments. In the case of 
multi-channel SAR systems, the assumption of zero-mean multivariate complex Gaussian distribution is 
frequently valid for geophysical media, being thus fully described by the complex Hermitian covariance 
matrix. This is the statistical case treated in this work. 

Since a SAR signal corresponds to the superposition of the scattered fields from all scatters inside 
a resolution cell, physical parameter estimation may be performed by exploiting the multi-channel 
covariance matrix. For that, eigendecomposition theorems have been widely used in the literature, 
decomposing the whole covariance matrix into elementary quantities, in order to provide a better physical 
interpretation of the data. 

The eigendecomposition of the multi-channel SAR covariance matrix leads to useful information 
in a large range of SAR applications, including Ground Moving Target Indication (GMTI) [1], 
polarimetric SAR (PolSAR) [2], interferometric SAR (InSAR) [3], polarimetric interferometric SAR 
(PolInSAR) [4], change detection [5], target detection [6] and filtering [7,8]. Particularly, the covariance 
matrix maximum eigenvalue has been proved to be a key parameter in various areas. In [5] and [9], 
the change detection problem was elaborated using the maximum eigenvalue of the interferometric 
covariance matrix. The eigendecomposition of the SAR covariance matrix with application to SAR 
GMTI is used in and to calculate the probability of moving target detection in homogeneous and 
heterogeneous terrain. Regarding polarimetry, the eigendecomposition of the coherency (which is also 
a covariance) matrix is the most common tool among incoherent target decomposition theorems for the 
interpretation of the earth's surface. The maximum eigenvalue of the coherency matrix is also often 
used in polarimetry, due to the close relationship to the dominant scattering mechanism of the scattering 
process [2,10,11]. 

Although there is a wide area of applications regarding the maximum eigenvalue of the multi-channel 
covariance matrix, a certain lack of information in the literature concerning its statistical characterization 
in terms of multi-channel SAR may be verified. The statistical description can be useful to understand 
bias effects and to analyze estimation as well as detection problems. The estimated (sample) covariance 
matrix follows a Wishart distribution, which has been of interest along the years since its first derivation. 
In areas as communication systems, several works using such results have been published [12-16]. The 
application of the Wishart distribution in remote sensing was however considerably late introduced [17]. 

The cornerstone study considering the statistical description of the covariance matrix 
eigendecomposition in polarimetry has been carried out in [18]. The majority of the analysis in [18] 
was performed on the basis of numerical methods. In this paper the results of [18] is supported by 
addressing analytical solutions. Additionally, it is derived an exact closed form expressions for the 
Moment Generating Function (MGF) of the sample covariance matrix maximum eigenvalue. 

The validation of the derived expressions is performed using simulated data, demonstrating its 
agreement with the theory. The effect of the number of samples and underlying correlation scenario 
of the sample covariance matrix on the bias in the estimation of the maximum eigenvalue is also 
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investigated. Finally, examples of applications are as well given, in the fields of estimation and detection 
theory. 

The next section reminds the basics of the statistical description of multi-channel SAR systems and of 
the covariance matrix eigendecomposition. It also includes the derived theorems presenting the statistical 
founds. Section 3 includes numerical examples validating the theorems as well as an analysis of their 
behavior. Sections 4 and 5 show their usage for the analysis of the estimation bias and for the elaboration 
of detection problems, respectively. Section 6 concludes the paper with discussions and directions for 
future work. The detailed statistical derivations can be found in the Appendix. 

2. Statistical Characteristics of the Maximum Eigenvalue of the Multi-Channel Sample 
Covariance Matrix 



2.1. Preliminaries 



The individual elements of a m-dimensional multi-channel system may be organized in a 
m-dimensional vector k. As the elements are assumed to follow zero mean complex Gaussian 
distributions, the vector k is said to follow a m-dimensional multivariate normal distribution, 
with zero mean and true covariance matrix £ among the vector elements, and is represented by 
k ~ A/^(0, £) [19,20]. For zero mean Gaussian statistics the covariance matrix fully describes the 
data, playing a key role in several application fields. In practical situations however, the true covariance 
matrix E is unknown and has to be estimated by its maximum likelihood estimator (MLE), the sample 
covariance matrix Z = (1/n) Yuj=\ kjkj\ where n is the number of estimation samples and f is the 
transpose conjugate operator. In the SAR context, the number of independent samples is also called 
looks. The elements of Z follow a m dimensional complex Wishart distribution with n degrees of 
freedom and true covariance matrix E, represented by Z ~ W^(n, E) and defined as [20]: 
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where T(-) is the gamma function and etr(-) is the exponential trace of a matrix. 

The spectral theorem from linear algebra allows the decomposition of the full rank m-dimensional 
covariance matrix in a set of m one-rank covariance matrices using its eigenvalues and eigenvectors. 
Accordingly, the decompositions of the true covariance matrix E = YlT=i^( e i e \) an( ^ * ts estimator 
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with their real non-negative eigenvalues k and Aj, and respective complex eigenvectors e*, e', for 
i = {1, ...m}. 
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2.2. Sample Maximum Eigenvalue Statistical Description 

The following Theorem III concerns the derived Moment Generating Function (MGF) of the 
maximum eigenvalue of the sample covariance matrix Z of a multi-channel statistical system, which 
is a critical step in removing the bias of the largest eigenvalue. The Theorem III, which is derived due to 
the previous works (Theorems I and II), is the main result of the paper. It is detailed in the appendix and 
will be used for the elaboration of illustrative estimation and detection problems in Sections 4 and 5. 

Throughout the paper, | • | is the matrix determinant, (X) n = (1/n) Y^h=i Xi denotes the estimator of 
the random matrix X formed from a sample of size n. 

Theorem I: Let k ~ A/"^(0, S) be a m-dimensional complex vector whose elements follow zero 
mean Gaussian distributions with associated m x m covariance matrix E. Let S have l m < .... < l\ 
eigenvalues. Then the Cumulative Density Function (CDF) of the maximum eigenvalue \ ma x of the 
sample covariance matrix (kk^) n , with the assumption m < n, is given by 

_m(m— l)„m(2n— m+l)/2 FT™ 1 k m ~ ^ 

S\V(x)\, with constant term S= = = — -, r (3) 

r m (m)r m (n) rj™iMr ( -~- 



Kj \ L It 
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where ^(x) is am x m matrix with its (i, j)th element ^(x)i j = ,„w n+1 _^ , and 7 is the incomplete 
Gamma function (Equation (2.42) in [21]). 

Proof: [14] gave the closed expression of CDF of the largest eigenvalue of the complex Wishart 
matrices. Here, it is written in the form of SAR covariance matrix after small continuation, see also 
Appendix A. 

Theorem II: Let k ~ A/"^(0, S) be a m-dimensional complex vector whose elements follow zero 
mean Gaussian distributions with associated m x m covariance matrix S. Let £ have l m < .... < h 
eigenvalues. Then the Probability Density Function (PDF) of the maximum eigenvalue \ max of the 
sample covariance matrix (kk^) n , with the assumption of m < n, is given by 

Px max (x)=S\^(x)\tr[^(x)- l n(x)] (4) 

where Q(x) is an m x m matrix with its i, jth elements Q(x)i t j = exp ^— fx^j x n ~^, and ^/(x) and S are 
defined in Equation (3). 

Proof: Equation (4) is obtained by differentiating Equation (3) with respect to x using (Equation (9) 
in [22]) 

f t \X(t)\ = \X(t)\tr(x(t)- 1 ^X(t)y (5) 

Here it can be noted that when the true covariance matrix S is diagonal, the PDF of the largest eigenvalue 
reduces to Ermolaev and Rodyushkin result [23]. 

Theorem III: Let k ~ A/"^(0, S) be a m-dimensional complex vector whose elements follow zero 
mean Gaussian distributions with associated m x m covariance matrix E. Let £ have l m < .... < l\ 
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eigenvalues. Then for any positive integers s, the sth moment of the maximum eigenvalue A max of the 
sample covariance matrix (kk^) n , with the assumption of m < n, is given by 

771 771 — 1 

E(K ax ) = Sj2(-iy* s ^ II „ + / ( ,A ^) 5 =^a^(b + Ar- M T(v + M)x 
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where 5 indicates the constant term as in Theorem I and • • ) is the hypergeometric function of 
several variables with a k = f^, u k = (2 + n — n sub (k))/2, v = l + s + n — i + YlT=i n ~ ns ^ b( ' k \ 
b = f. + YJk=i it' ^ = n ~ 7Ts " b(k \ M = YIk=i u k , A = I YlT=i a k- Here > m e sum is computed over 
(m — 1)! permutations of 7r s . S'm denotes the set of all ml permutations of the set S = {1, 2, m}, and 
sgn(7r s ) denotes the sign of the permutation tt s : +1 if n s is an even permutation and —1 if it is odd. 

Proof: See Appendix section B. In the appendix the moment of sample maximum eigenvalue is also 
given in more friendlier form for programming obtained by splitting the hypergeometric function of 
several variables into a sum of its variables. 



2.3. Dependence of the Covariance Matrix Eigenvalues 



Before starting the following analysis, it would be nice to illustrate the dependencies of the 
eigenvalues on the covariance matrix parameters, which is necessary for the further understanding of the 
addressed topics. For that, a two-dimensional covariance matrix is used, originated from the expected 
values of the outer product of the vector k = [ki k 2 } T , i.e., 
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where a\, erf are the variance or power of k\ and k%, respectively, p is the absolute and 0 the phase of 
their complex correlation coefficient, and T means transpose. 

The eigenvalues of the true covariance matrix can be shown to be given by [5]: 
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The behavior of l\ and l 2 is presented in Figure 1, as a function of a normalized power ratio b and a 
correlation p. Note their mutual dependence, making clear that b and p directly determine the behavior 
of both eigenvalues. 
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Figure 1. Two-dimensional system configuration, (a) True eigenvalue l x and (b) true 
eigenvalue l 2 as a function of correlation and normalized power ratio b. (c) Counter plots 
of U. 




(a) (b) (c) 

3. Validation and Analysis of the Theoretical Expressions 



This section aims to validate the theorems mentioned in the previous section using simulated 
data. The simulated data have been generated using different multi-dimensional configurations, 
where the correlation between channels have been generated using the well known Mahalanobis 
transformation [24]. 

Figure 2. (a) Comparison between the theoretical distributions and the histograms obtained 
from simulated data of the maximum sample eigenvalue, (b) The distribution of the 
maximum sample eigenvalue as a function of the number of samples, for a three-dimensional 
system. In both case, the powers are given by a kl = a k2 = o ki = 1 and correlations by 

Pkik 2 = 0, p kl k 3 = 0.8 and p k2k3 = 0. When n -)• oo, X max = h = 1.8. 




(a) (b) 

Figure 2(a) shows the comparison of the Equation (4) with simulations. The theoretical PDF curves 
clearly agree with the histograms obtained from simulated data. As expected, the PDFs become narrower 
with increasing n, indicating less variance around the true value of the maximum eigenvalue Z x = 1.8. 
This behavior can be better seen in Figure 2(b) where the distribution of X max as a function of the 
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number of estimation samples n is presented. Note also that the expected value of the distributions 
seems to change for different n. 

Figure 3(a) shows the variation of the histogram mean as a function of n for a two-dimensional 
case with fix correlation p = 0.2. In Figure 3(a), the theoretical expected value of \ ma x has been also 
over plotted for comparison. The curves match well, which validates Theorem III. The same has been 
carried out for the second order moment, i.e., the variance of \ max , and is presented in Figure 3(b). The 
agreement between the theoretical variance of \ ma x and the variance of the simulated data can again be 
confirmed. Also, as already observed in Figure 2(a), the lower the number of samples n, the higher the 
variance of X max . 

Figure 3. Theoretical versus simulation results (a) for the first order statistics (mean), 
and (b) for the second order statistics (variance), of the sample maximum eigenvalue of a 
two-dimensional system with powers a kl = a k2 = 1 and correlation p klk2 = 0.2. 
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The impact of the correlation and the number of samples on the behavior of the third (skewness) 
and fourth (kurtosis) order moment of the maximum sample eigenvalue X max is presented in Figure 4. 
Skewness is a measure of how symmetrical the distribution is with respect to its mean. Figure 4(a,b) 
indicates that the skewness of \ m ax converges to zero for increasing n and increasing p, expressing the 
tendency to a symmetrical distribution in that cases. 

Kurtosis is a measure of the peakedness of the distribution. Figure 4(c,d) shows that for increasing 
n and p, the kurtosis of the \ m ax distribution tends to three, which corresponds to the kurtosis of the 
normal Gaussian distribution. 



4. Estimation Bias 

Figure 5 shows the expected value (first order moment) of the maximum sample eigenvalue \ m ax as 
a function of the true eigenvalues l max for a fixed number of samples n = 3 and k — 1. The variation of 
the underlying correlation of the true covariance matrix p, which changes the values of l rnax , has been as 
well indicated in a color code. Observe that the higher the correlation is, the higher l max becomes. 
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Figure 4. Theoretical results for the third (skewness) and the fourth (kurtosis) order statistics 
of the sample maximum eigenvalue of a 2D system having powers a kl = <J k2 = 1 and 
correlations p = {0.2, 0.3, . . . , 0.9} versus the number of samples n = {2, 3, . . . , 62}. 
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Figure 5. Effects of the correlation between channels on the expected value of the maximum 
sample eigenvalue keeping n = 3 and k — 1. 




The Was of the estimator of a certain parameter is defined as the difference between the expected 
value of the estimator and the true value of the parameter. In the present case, the bias is hence given by 



Sensors 2012, 12 



2774 



E[\max] — Imax- Notice thus from Figure 5 that the bias of X max becomes very strong for low values of 
the correlation l max (or equivalently, low values of p). 

The effect is emphasized in Figure 6(a,b), where the bias of X max is presented as a function of l max 
and l 2 , for n = 3 and n = 16, respectively. The values of the underlying correlation p have been also 
indicated. The bias is small either when the correlation is high or the number of samples is sufficiently 
large. When the correlation between channels is low, the bias becomes very significant and a large 
number of samples is necessary to decrease the estimation bias. 

Figure 6. The bias, E[X max ] — l max , of the sample maximum eigenvalue with the 
number of samples 3 and 16 in various correlated channels having a standard deviation of 

Ofci = <Jk 2 = 1- 



p = 0.1 




(a) n=3 (b) n=16 



5. Analysis of Detection Problems 

Detection theory is a means to quantify the ability of a procedure to detect a parameter (or signal) 
immersed in a noise environment. A decision has to be taken, in order to say yes, there is a signal, or no, 
there is no signal. Such decision is usually taken under the application of a decision threshold. Noise 
contributes off course negatively inducing wrong decisions. For the quantification of detection accuracy, 
some quantities are usually defined as the probability of detection (PD) and probability of false alarm 
(PFA), allowing a detection problem analysis. 

The maximum eigenvalue of covariance matrices has been frequently used in the literature for the 
elaboration of certain detection problems. In the Ground Moving Target Indication (GMTI) area, for 
instance, the signal to clutter plus noise ratio has been studied under the distribution of the sample 
eigenvalues, which allowed the implementation of a constant false alarm rate detector [9]. The 
eigenvalues of the covariance matrix of a SAR image pair have been also used in order to formulate 
a problem in the change detection area [5]. Another example is the determination of the existence 
of just a single dominant scattering mechanism in a SAR polarimetric acquisition, which also has 
been evaluated using a threshold in the maximum sample eigenvalue of the polarimetric covariance 
matrix [11]. Target detection and polarimetric filtering represent other fields of application in which the 
maximum eigenvalue can be used. 
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Having the closed form expressions of the PDF (Equation (4)) and/or CDF (Equation (3)) of the 
maximum sample eigenvalue of the covariance matrix, the PD and PFA when applying a threshold in 
X max can be analytically computed, allowing a complete detection problem analysis. 

5.1. Detection of a Dominant Maximum Eigenvalue 

Since several detection problems rely in fact on the choice of a dominant or non-dominant l max by 
applying a threshold in X max , the following problem with two hypotheses is elaborated 

H 0 (lmax is dominant) : l max = h and k = 0 
Hi {lmax is not dominant) : l max = l x = k 

for i — {2, 3, . . . , m}. 

In one application area H 0 may mean that just a single scattering mechanism is present inside a 
resolution cell of polarimetric SAR data, in other application H 0 can mean that a change happened or a 
moving target is present in the scene. 

Since l max is not achievable, a threshold T has to be used in X max , originating the following PD and 

PFA 

PD = p [accepting H 0 \ H 0 is true] 

/oo 
PA ma , (x; H 0 ) dx 

PFA = p [accepting H 0 \ Hi is true] 

/oo 
PXmax ( X , Hi) dX 

as a function of the decision threshold T. 

Hence, for a two-dimensional system configuration, the distribution p\ max (x; H 0 ) is the distribution 
of Xmax when li = 2 and / 2 = 0. For a three-dimensional system, p\ max (x; H 0 ) is determined evaluating 
the distribution of X max when l t = 3 and l 2 = I3 = 0. Higher dimensional system configurations follow 
the same rule. 

On the other hand, the distribution p\ max (x; Hi) is given by h = l 2 = 1 and h = l 2 = h = I for a 
two- and three-dimensional system, respectively. 

For each value of T, there exists a pair (PFA, PD). The curves of PD versus PFA are called 
Receiver Operating Characteristic (ROC) curves and express the detection performance. The more a 
ROC curve bends toward the upper left, the better is the detection performance since a higher PD and 
lower PFA is achieved. 

Figure 7(a) shows the detection performance as a function of the dimension of the multichannel 
system, i.e., m = 2, e.g., interferometric, m — 3, e.g., polarimetric and m = 6, e.g., 
polarimetric-interferometric system. For all cases, with fixed number of samples n = 6, one can realize 
that the performance of detection is significantly improved as the number of SAR images increases. 
Figure 7(b) shows the ROC curves for m = {2, 3} and for different number of samples n. It can be 
seen that for both multidimensional system configurations the number of samples increases the detection 
performance. 



Sensors 2012, 12 



2776 



Figure 7. Maximum sample eigenvalue detection performance as a function of the number 
or channels m, samples n and correlation p. (a) ROC curves for different multidimensional 
systems, m = {2, 3, 6} and n = 6. (b) ROC curves for the two- and three-dimensional cases, 
and for different number of samples n. (c) ROC curves for the two-dimensional case, n — 3, 
and for different correlation. 




(a) (b) (c) 

In order to state how correlation effects the detection performance, the detection problem is 
reformulated as follows 

#o (Imax is dominant) : l max = l x > l 2 > l 3 , . . . , > l m > 0 
H x (l max is not dominant) : l max = h = k i = {2, 3, . . . , m}. 

In this way, every time that correlation is greater than zero the eigenvalues have different values, and 
one is larger than the others. For the two-dimensional case, for instance, P\ max (x; H 0 ) is the distribution 
of \max when li > l 2 , which changes for different values of the correlation p. Figure 7(c) presents the 
ROC curves for this case. For small correlation between channels, the detector suffers from a significant 
false-alarm rate. As expected, when correlation increases the ROC curve bends toward the upper left, 
indicating better detection performance. The limit is reached when p — 1, meaning that l\ — 2 and 
h = 0, which corresponds to the cases of the previous detection problem. 

5.2. Target Detection Using Polarimetric Matched Filter 

In this section the application of the expressions for the detection of specific targets using the 
Polarimetric Matched Filter (PMF) concept is illustrated. For that, a short review on PMF is 
required [6,7]. 

In a polarimetric acquisition, the dimension of the system corresponds to the number of channels, 
which are in general four, but three for the backscattering reciprocal case. The measured vector k is 
hence three- or four-dimensional. The elements of the vector k may be weighted in a given way, in order 
to satisfy a certain condition. The weighting can be accounted for by making y = h^k, where h is a 
complex vector with same dimension as k. A condition usually aimed to be satisfied in the literature is 
the maximization of the quadratic detector \y\ 2 (Equation (59) in [6]). The optimal weighting vector h 
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in order to detect a distributed target with the target vector k is dependent on the target characteristics. 
Accordingly, the expected value of \y\ 2 is given by 

E{\y\ 2 } = E{h ] k(h ] k) ] } = h ] ^ t h (9) 

where E t = E{kk^} is the target covariance matrix. Regarding Rayleigh quotient (Theorem 15.91 
in [25]), for any m-dimensional complex vector x and a given m x m Hermitian matrix A, x^ Ax < 
\\x\\ 2 a max , where a max is the maximum eigenvalue of A. The equality is valid if x is along the direction 
of the a max eigenvector U max (\\U max \\ = 1). Hence, for a given distributed target with covariance matrix 
E t , the optimum weighting vector h is given by the eigenvector of the maximum eigenvalue of S t . 

The target vector k is a random sample and has alone no physical meaning. Therefore, usually 
multi-look processing is performed making, for n samples (or looks) 

■n 

1=1 

The target is assumed to be immersed in polarization independent clutter. In this way, the detection 
procedure is thus evaluated by choosing h as the eigenvector corresponding to the maximum eigenvalue 
of the covariance matrix T, t of the target to be detected, and applying a detection threshold y 2 > T. 

In the presence of target y 2 is maximum, given by y 2 = || h\\ 2 X max , where h is the eigenvector of A max . 
Hence, assuming without loss of generality that h has unitary length (\\h\\ 2 = 1), detection performances 
can be made using the derivations given in this work, as done in the previous section, where a threshold 
has been used in order to detect a dominant maximum sample eigenvalue (X max = y 2 > T). 

Note that the probabilities of detection and false alarm defined in last section can be more simply 
evaluated by 

poo pT 

PD = / Pa_ (x; H 0 )dx = l- Pw (x; H 0 ) dx = 1 - F Xmax (T; H 0 ) 

JT J-oo 



(11) 



PFA =1- F Xmax (T; Hi 



A three-dimensional polarimetric target detection problem was formulated making k = [ShhShv 
Svv] i where Si is the polarimetric scattering matrix element in channel i, and using the target 
covariance matrix structure 

1 0 PV ^" 

E t = S HH 0 2e 0 (12) 
. P*Vl 0 7 

where e = ^|Jg^j 2 | , 7 = ^|||^j 2 j and p is the complex correlation coefficient between Shh and 
Sw [6,26]. Figure 8 shows the ROC curves for different number of samples for three different kinds 
of targets. Figure 8(a) corresponds to an azimuthal symmetric target having covariance matrix with 
parameters e = 1, 7 = 0.5 and p = 0. Figure 8(c) corresponds to a reflection symmetric target having 
covariance matrix with parameters e = 1, 7 = 0.8 and p = 0.8, while Figure 8(b) to a reflection 
symmetric target with parameters e = 0, 7 = 0.8 and p = 0.8 in its covariance matrix. For all three cases, 
Shh = 1 and the PFA was evaluated using the polarization independent clutter having e = 1, 7 = 1 
and p = 0. Note that the curves vary not just with the number of used samples n but are also different 
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for different targets having different S t . This means that some types of targets are easier to detect than 
others, when using the quadratic detector described here and when the clutter is polarization independent. 
When the target covariance matrix is similar to the one of the clutter, the detection performance weakens 
(Figure 8(a)). On the other hand, when the covariance matrix of the target is significantly different from 
the clutter one, the detection performance improves (Figure 8(c)). 

Figure 8. Detection performance of three different distributed scatterers using the PMF 
concept for different number of samples, (a) Azimuthal symmetric scatterer with e = 1, 
7 = 0.5 and p = 0. (b) Reflection symmetric scatterer with e = 1, 7 = 0.8 and p = 0.8. 
(c) Reflection symmetric scatterer with e = 0, 7 = 0.8 and p = 0.8. The PFA is evaluated 
using the polarization independent clutter with e = 1, 7 = 1 and p = 0. 




False alarm rate False alarm rate False alarm rate 



(a) (b) (c) 

6. Conclusions and Discussion 

In this paper a depth statistical analysis of the maximum eigenvalue of the eigendecomposition of the 
sample covariance matrix in terms of SAR applications is presented. The proposed analysis are supported 
by simulation results via several examples. The results are based on a exact closed-form expressions of 
PDF, CDF and MGF In this study, existing density functions of the sample maximum eigenvalue were 
extended and/or implemented into multi-channel SAR system in order to obtain a simple expression 
of the sample eigenvalues giving a way to fruitful applications. From these closed-form expressions, 
it has been possible to develop new algorithms for unbiased calculations of parameters extracted from 
multi-channel SAR covariance matrix. In addition to these implementations, closed-form expressions 
were developed for the MGF of the sample maximum eigenvalue, which can be critical in the area of 
bias removal and detection performance analysis. This new closed-form expressions of the MGF can be 
also interesting for other application areas like MIMO systems (Multiple-Input Multiple-Output). Apart 
from estimation theory analysis including the MGF, the detection problem of the sample maximum 
eigenvalue has been also discussed. 
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Appendix 

This Appendix starts with the reminder of some basic results from linear algebra and statistical theory 
to prove the Theorem III proposed in this manuscript. 

Definition 1 (Theorem A3 in [19]): The determinant of a square m x m matrix A is 

m 

\A\ = sgn(7r)ai fcl a 2 fc 2 ...a mfcm = ^ sgn(7r) a iki (Al) 

7T 7T 1=1 

where J2w denotes the summation over all m! permutations, and sgn(7r) denotes the sign of the 
permutations determined by (— where ir is the permutation symbol. Notice that the determinant 
is a combination of the determinants of its sub-matrices, and permuting the columns or rows of a matrix 
changes the sign of the determinant related to the permutation. 

Definition 2 (Corollary 2 in [27]): Given two arbitrary m x m matrices and ty(x) with ijth 
elements &i(xj) and ty^Xj), and an arbitrary function £(•), the following identity holds: 



$(x)| Y[^,(x k )dxidx 2 ...dx r 



b 

®i(x)if?j(x)£(x)dx 



(A2) 



where the multiple integral is over the domain V = {a < x m < ■ ■ ■ < x 2 < X\ < b}. 
Proof of Theorem I 

In the case m < n, the sample covariance matrix has full rank and follows a central Wishart 
distribution. It can be noted that [28] is an interesting work concerning the case n < m. In Wishart 
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history, the joint PDF of the real ordered sample eigenvalues oo > Ai > . . . > X m > 0 of Z is given by 
Equation (36) in [18] 



Ps(H) 



7T 



m(m— l)„m(2n— m+l)/2 



exp — n 



rn-l rrm \n— m rrm /\ \ 

"Q fc m I I ' I I ./ (A ' ~~ A 



ij=l fc=l 



i _ i 



(A3) 



Y m {m)Y m {n) 

with s = g H (z) n g 

where H = diag{Ai, . . . , A m }, ps(S) is the joint PDF of the eigenvalues of the sample covariance matrix 
oo > Ai > . . . > A m _i > A m > 0 and oo > l\ > ... > l m -\ > l m > 0 are the eigenvalues of the true 
covariance matrix. 

The CDF of the maximum eigenvalue X max — Ai is obtained using 



Fx (x) 



p(X m < x, . . . , Ai < x) 
p(\x, X 2 , A m )dAi...dA r 



(A4) 



v 



where p(Ai, A 2 , A m ) is the joint PDF of the eigenvalues as defined in Equation (A3), and 
V = {0 < X m < ... < Ai < x}. Before substituting Equation (A3) in Equation (A4) it is desirable 
to write a friendlier expression for the joint PDF Equation (A3). If the constant is denoted by S as in 
Equation (3), then Equation (A3) can be written as 

\ \ tn 



P3 (h) = s n xr m n - A i) ^ ( - n T 

i=l i<j ^ 1 ' 



(A5) 



Due to the definition of the Vandermonde determinant [25] the multiplication of YYiLi A" m an ^ 
UT<j ( A * ~ A i) is: 



n—m 
1 


0 


0 


0 




0 


0 


0 


\n—m 



m— 1 



A™ 



m— 1 



\ m—m 
A m J 



a; 



n-l 



Ar 



\ n— 1 
. m 



\ n— m 



-> A ■ 



U,i=l 



(A6) 



Then the joint PDF of the sample eigenvalues can be written in the form of 

^ = s\xr j \ 



Pe 



i '" 

\i,j=l 



exp — n 



(A7) 



By applying the result of Definition 2 and Definition 1 into (A4) with ^i(Xj) = A™ 3 and $i(Aj) 
exp ^— n^-j , it yields 



fx) =S 



X n ~ j exp(-n^)dA 



(A8) 



Finally, solving the remaining integral using Equation (351) in [25] yields the desired closed-form 
expression of the sample maximum eigenvalue CDF of multi-channel covariance matrix 

/ \ (j—n— 1) 

I Tl \ TX 

Fx max [x) = S\^{x\ 3 \ with *(x) i>j =[ T ) j(n + l-j,x-) (A9) 



where j(a, x) = j Q c x a 1 e x dx denotes the lower incomplete gamma function. 
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Proof of Moments of the Sample Maximum Eigenvalue 

For any positive integer s, the sth moment of the maximum eigenvalue is obtained using 

poo 

E(X s max ) = / x s p Xma ^x)dx (A10) 
Jo 

where p\ max (x) is the PDF of \ ma x- It has been shown in Equation (4) that 

Px ma M = S|¥(s)|tr [*(x)~ 1 fi(x)] (All) 



exp 



fx)x n j and 



where fi(x) and ty(x) are m x m matrices with ijth element 

/ \ -(n+i-j) 

^(x)ij — \f.) j{n + 1 — j, xf;), and 5 denotes the constant terms given in Equation (3). 

It is desirable to write a friendlier expression for Equation (All) to be useful for deriving the moments 
of the sample maximum eigenvalue. Due to the definition of the inverse of the m x m matrix A^ 1 = 
CfJ\A\, Equation (All) can be written in the following form 



Pw(*)=Str [C T (x)Q(x 



(A12) 



where C T (x) is the transpose of the cofactor matrix ^(x). Since the trace of a square matrix is the 
sum of the diagonal elements of the matrix, Equation (All) can be written as the sum of the diagonal 
elements of D(x) = (C T (x)tt(x)) . 



s 



x s tr [C T {x)Q{x)} dx 



POO 

S / x'tr[D]dx 
Jo 



(A13) 
(A14) 



Now the problem in Equation (All) is the evaluation of the diagonal elements of the D(x). The 
matrix D(x) is the product of the two square matrices and the trace of the product of two matrices is 

m 

tr(C T f2) = CjiVtji = Cn^ii + • • • + C m iVl m i + • ■ ■ + Ci m Qi m + ■ ■ ■ + C mm ^l mm shown as 



/ 



tr 



On C21 
C12 C22 



\ [ylm 



2 m 



C m i 

C m 2 

Crn.m. 



e x 'ix n 1 e x 'ix n 2 



e x 'ix n 1 



x hx n ~ 1 



e~ x T^x n ~ l 



x h x n ~ 2 



e lm x 



'2£ 



n— 2 



\ 



(A15) 



J7 



where Cji is the determinant of the sub-matrix Mji obtained from ty(x) by removing its jth row and ith 
column. Then, the j, i th cofactor of ty^, denoted by Cji, is Cy L = (— |Mfj|. Now, Equation (A14) 
is: 

m poo 

E(X s max ) =$Y, ^CjiVjAx (A16) 



where Cji can be extracted from ty(x) by Definition I as follows, 



C 



7r sub^^m 



fc=i 



if k = j A tt s (J) = i, tc s e 7l s 



(A17) 
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Here, the sum is computed over (to — 1)! permutations of ir s . S m denotes the set of all to! permutations 
of the set S = {1,2, ...,m}, and sgn(Ti su b) denotes the sign of the permutation tt s : +1 if n s is an 
even permutation and —1 if it is odd. After interchanging the order of the summations and integral in 
Equation (A 16), the integration reduces to the following form 



E(X s max ) = S E(- 1 ) <+j I ^ ex P {-J. x ) x% 



/ m \ 

1 sgn(7r^)n7(^ + l-vr sub (fc),a ; ^)(^)—( fc )— ^ 



k=l 



7(71 + 1 



m /»oo 

5^(-l) i+J £ sgn(7w,) / x s exp 



— — x l ./•' 



x 



nr=i7(^ + l-vr sufe (fc),x^)(/ 



ri \/ n_ \iT sub (k)-n-l ' 



dx. 



(A18) 



7(n + l -i,x±)(*:) i - 1 - n 

It can be seen from Equation (A18) that the integration includes the multiplication of (to — 1) times 

incomplete gamma functions with exponential and powers. After applying identities (ex. 10.3, 
Equations 10.15 and 10.65 in [21]), 



n— 1 —ax 



7 (n, ax) — -(ax) ? e ? Ain^i n(ax), | ^ — 1, — 2, — 3, .. 



7 n + 1 - n sub (k) : j-x 



n+l-TT sub (k) \ l k 



2 ' 2 

n — 7r 

x)~ 



a solution of integration in Equation (A 18) may be obtained by using the following formula 



(A19) 



3 e 2 'fc M n-* aubW n + i-, sub(k ) (^x), (A20) 



x 



v—l„—bx 



e- b *M x i(x ai ) . . . M XmtU Uxa m )dx = af . . . <T (6 + A)~ v ~ Y(v + M)x 



F A [v + M; m - Ai, u m - A m ; 2u u 2u r 



ai 



(A21) 



b 1 + A , '" , b m + AJ ' 

where M = uH hw m andA = (l/2)(aH ha m ) are valid for 3?(M+v) > 0, and U{b±\ai±- ■ • + 

|a n ) > 0. Here Aik, m (x) is a Whittaker function of the first kind Equation (7.622) in [25] and Fa{ol : 
/3 m ; 71, . . . , 7 m : Zi,..., z m ) is a hypergeometric function of several variables Equation (9.19) 
in [25]. After replacing the integration part of Equation (A18) by Equation (A21), it can easily be shown 
that for m dimensional SAR system with n > m, the moments of the sample maximum eigenvalue is 



m 



\i+3 



E(X s m J = s ^(-i 

k^iAir sub (k)^j 



^2 sgn(vr sufe ) 



1 



POO 771 1 

o U n + 1 -^subikyik 



i+n— i+ 



,n n 



2 exp(-x(- + r r ))A^ n-^(fc] n+i-^ h(fc ) (x-)dx 



21 
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s ) 

max J 



in m— 1 -. 



L n + 1 - 7r sufe (A;) / fe 



2 a? fc x 



(b + A)~ v ~ M r(v + M)xF A [v + M;u 1 -X 1 ,..., u m ^ - A m _i; 2u 1; . . . , 2^-, 



a i 



h + A' 



l m—l 



bm-1 + A 



(A22) 



where a k = f-, u k = (2 + n - 7r su6 (£;))/2, w = l + s + n- z + XX=i 



rn-l n-TT sub (k) 



n-7r SM j,(fc) 



' _ «j ^k=l 2l k ' 

M = YlT^i u k an d ^4 = | XlfclTi 1 a fc- Notice that and 7r sub (k) takes values from 



(fc, 7r su6 (A;) = tt su6 (A;) 6 {1, 2, . . . , m} , k ^ j A ir sub (j) ^ i}) related to given 

Although Equation (A22) is an exact closed-form expression for the MGF of the sample maximum 
eigenvalue, the hypergeometric function of several variables is in general very difficult to evaluate, 
and it is better to express Equation (A22) in a more tractable form for future analysis. Regarding 
Equation (2.53) in [21], the incomplete gamma function can be written in the closed form 



n , 



j(n + 1 - n sub (k), x—) = {n- 7r sub (k))\ 



1 — exp ( —x 



\ 



>(*) 



V 



E 

t=0 



X-, 



»v\ 



J 



\ 



J 



(A23) 



for n E Z + , which is a constraint fulfilled by our problem. For the sake of simplicity, f(k, n S ub(k)) 



^ n—w, 



exp l —x 



h 



\ 



E 

t=0 



nb(k) | x _ 



\ 



t\ 



, and Y[(n - ir sub (k))\ (l - f(k, Tr sub(k) )) = JJ a k \ (1 - f k ) indicate 

fc=i fc=i 

m times multiplications of incomplete gamma function. Then if this multiplication is extended as in the 
form 



/ 



/ (?) (?) 

Y[a k \ i+^(-i)v fel+ j2 (-i) 2 / fcl / fc2 + --- + 



k=l \ fci=l 



fc 2 =2>fci=l 



E (-1)™ fkjk 2 - ■ • fk, 

k m =m>--->k2=2>ki=l 



and the integration can be easily implemented into Equation (A 18) as follows 



? 

/ 



(A24) 



m „ c 

E(\ s max ) = s j2(- i r +1 E ^s Ub )j o 



x s+n 'exp [ ~y x 

i / \k=i 



Y[{n - n sub (k))\ 
k=i 

(l - f(k,n suKk) ))) /^jidx 



(A25) 
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By analyzing that the term ^ in the determinant of Equation (A25) reduces to (m — 1) multiplication 
of incomplete gamma functions: 



/ 



tf(A^)=S]C(-l)*-' E II a * ! 

ij = l 7r 3 ubG5 m fc=l 



x exp 



E 



k m -i=m— l>--->fc2=2>fci = l 



/fci/fca • • • /fcm-i^ exp f - -z ) x n Mx 



n 



x 
\ 
J 

(A26) 



Applying the rule J 0 °° x n e ux dx = nlu n 1 in Equation (A26) it yields: 



m— 1 



£(ALJ=<S E(-1)^ E sgn(7r su6 ) JJ 
i,j=l 7r SU 6e5 m fc=i 



/ 



(n — i + sy. 

\n— i+s+1 



( m i X ) ri-7r stl6 (fci) f JL 



n-7Tsu6(fcm-l) 

E 

"■771 — 1 



+ E E 

ki=l t kl =0 

, ^^m — 1 



(s + n - z + tjfcj! 



+ , 1 / \ s+n-i+tk , +1 

kfcl • j n | _n_ 1 



+ . . . + 



^m— 1 



n-7r s „i,(fei) [ 



E 

^=0 



+ + H htfc m _J! 



fel ■ f n j n 



s+n-i+t fcl H Mfe m _j +1 



(A27) 

The sth moment of sample maximum eigenvalue can finally be written in a closed-form expression as 



s ) 

max I 



^E E s S^sub)G(k,n sub (k)), if k = j A ir k {j) = h TTfc e ix sub (A28) 



m— 1 



<3{k,ir sub (k)) = ] J(// - 7r.,„,,(A-))! 



(n — « + s)! 

' n\n— i+s+l 



(A29) 



fc=i 

/ m— 1\ 



I J n-7r SIli) (fci) [ JL 



+ E E 



[s + n-i + t kl )\ 
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The moments of the sample maximum eigenvalue regarding its PDF is solved. For larger m 
dimensional systems, such an approach may become complicated due to the need for a large number 
of calculation of cofactors. However, for applications where only a few eigenvalues, like polarimetry 
(m = 3) and interferometry (m = 2), are of interest, the numerical calculations are quite rapid and 
stable. For example, in the simplest case of m = 2, n > 2, the MGF is given by 




if ir k (j) = i, n k e n sub , and if k ^ j and l nsub (k) t"H k = 1,2 

= ST(2n)r(n - 1) f^yY" 
\nii + nl 2 J 

Yl sgn(n sub )(n+l-k) 2 F 1 (l,2n-,n + 2-k-,-—^ ) J . 

(A31) 



where the constant S = ffaj]ir(^PiT(^i) for m = 2 and 2-^1(0, b; c; x) is a Hypergeometric function 
Equation (9.8) in [21]. 

© 2012 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article 
distributed under the terms and conditions of the Creative Commons Attribution license 
(http://creativecommons.0rg/licenses/by/3.O/). 



